Efficient Acronym-Expansion Matching for Automatic Acronym Acquisition
نویسنده
چکیده
Acronyms are a very dynamic area of many languages. An efficient dynamic programming algorithm for matching acronyms with their expansions by maximizing a linguistic plausibility score is presented and is found to be very accurate, to =99.6% on a corpus of acronym definitions. Given its high precision, the algorithm can be used as a component in new or existing automatic acronym acquisition systems.
منابع مشابه
A (acronyms)
Acronyms are a significant and the most dynamic area of the lexicon of many languages. Building automated acronym systems poses two problems: acquisition and disambiguation. Acronym acquisition is based on the identification of anaphoric or cataphoric expressions which introduce the meaning of an acronym in text; acronym disambiguation is a word sense disambiguation task, with expansions of an ...
متن کاملAutomatic Acronym Acquisition and Term Variation Management within Domain-Specific Texts
In this paper we present a framework for the effective management of terms and their variants that are automatically acquired from domain-specific texts. In our approach, the term variant recognition is incorporated in the automatic term retrieval process by taking into account orthographical, morphological, syntactic, lexico-semantic and pragmatic term variations. In particular, we address acr...
متن کاملManaging the Acronym/Expansion Identification Process for Text-Mining Applications
This paper deals with an acronym/definition extraction approach from textual data (corpora) and the disambiguation of these definitions (or expansions). Both steps of our global process of acquisition and management of acronyms are precisely described. The first step consists in using markers such as brackets to identify expansion candidates. The alignment of the letters allows to select the ac...
متن کاملProcessus global d'acquisition et de gestion des sigles
This paper deals with an acronym/definition extraction approach from textual data (corpora) and the disambiguation of these definitions (or expansions). Both steps of our global process of acquisition and management of acronyms are precisely described. The first step consists in using markers such as brackets to identify expansion candidates. The alignment of the letters allows to select the ac...
متن کاملAutomatic Building Gazetteers of Co-referring Named Entities
Noun phrase (NP) co-reference resolution is a problem involved in many Natural Language areas, such as Dialog, Information Extraction, Summarization and Question Answering, among others. Especially important issues regarding this problem are the detection of aliases and the detection and expansion of acronyms. In this sense, terminological and general gazetteers of Named Entities (NEs) being al...
متن کامل